AI evaluation standards AI News List

AI evaluation standards AI News List | Blockchain.News

AI News List

List of AI News about AI evaluation standards

Time	Details
2025-08-05 17:26	OpenAI Study: Adversarial Fine-Tuning of gpt-oss-120b Reveals Limits in Achieving High Capability for Open-Weight AI Models According to OpenAI (@OpenAI), an adversarial fine-tuning experiment on the open-weight large language model gpt-oss-120b demonstrated that, even with robust fine-tuning techniques, the model did not reach high capability under OpenAI's Preparedness Framework. External experts reviewed the methodology, reinforcing the credibility of the findings. This marks a significant advancement in establishing new safety and evaluation standards for open-weight AI models, which is crucial for enterprises and developers aiming to utilize open-source AI systems with improved risk assessment and compliance. The study highlights both the opportunities and the limitations of open-weight AI model deployment in enterprise and research environments (Source: openai.com/index/estimating-...). Source

Time

Details

2025-08-05
17:26

OpenAI Study: Adversarial Fine-Tuning of gpt-oss-120b Reveals Limits in Achieving High Capability for Open-Weight AI Models

According to OpenAI (@OpenAI), an adversarial fine-tuning experiment on the open-weight large language model gpt-oss-120b demonstrated that, even with robust fine-tuning techniques, the model did not reach high capability under OpenAI's Preparedness Framework. External experts reviewed the methodology, reinforcing the credibility of the findings. This marks a significant advancement in establishing new safety and evaluation standards for open-weight AI models, which is crucial for enterprises and developers aiming to utilize open-source AI systems with improved risk assessment and compliance. The study highlights both the opportunities and the limitations of open-weight AI model deployment in enterprise and research environments (Source: openai.com/index/estimating-...).

Source